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Abstract. Wc put forward a new take on the logic of quantum mechanics, following 
Schrddingcr's point of view that it is composition which makes quantum theory what it is, 
rather than its particular prepositional structure due to the existence of superpositions, 
as proposed by Birkhoff and von Neumann. This gives rise to an intrinsically quantitative 
kind of logic, which truly deserves the name 'logic' in that it also models meaning in 
natural language, the latter being the origin of logic, that it supports automation, the 
most prominent practical use of logic, and that it supports probabilistic inference. 



§1. The physics and the logic of quantum-ish logic. In 1932 John von 
Neumann formalized Quantum Mechanics in his book "Mathematische Grund- 
lagen der Quantenmechanik" . This was effectively the official birth of the quan- 
tum mechanical formalism which until now, some 75 years later, has remained 
the same. Quantum theory underpins so many things in our daily lives in- 
cluding chemical industry, energy production and information technology, which 
arguably makes it the most technologically successful theory of physics ever. 

However, in 1935, merely three years after the birth of his brainchild, von 
Neumann wrote in a letter to American mathematician Garrett Birkhoff : "I 
would like to make a confession which may seem immoral: I do not believe 
absolutely in Hubert space no more." (sic) — for more details see [7Tj . 

Soon thereafter they published a paper entitled "The Logic of Quantum Me- 
chanics" |13) . Their 'quantum logic' was casted in order-theoretic terms, very 
much in the spirit of the then reigning algebraic view of logic, with the distribu- 
tive law being replaced with a weaker (ortho) modular law. 

This resulted in a research community of quantum logicians [311 1481 1661 169j . 
However, despite von Neumann's reputation, and the large body of research that 
has been produced in the area, one does not find a trace of this activity neither 
in the mainstream physics, mathematics, nor logic literature. Hence, 75 years 
later one may want to conclude that this activity was a failure. 

What went wrong? 

1.1. The mathematics of it. Let us consider the raison d'etre for the 
Hilbert space formalism. So why would one need all this 'Hilber space stuff', 
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... then matter ... then speech", "How computer science helps bringing quantum mechanics to 
the masses" , "Selling categories to the masses" , or the actual title of this paper itself. 
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i.e. the continuum structure, the field structure of complex numbers, a vector 
space over it, inner-product structure, etc. Why? According to von Neumann, 
he simply used it because it happened to be 'available'. The use of linear algebra 
and complex numbers in so many different scientific areas, as well as results in 
model theory, clearly show that quite a bit of modeling can be done using Hilbert 
spaces. On the other hand, we can also model any movie by means of the data 
stream that runs through your cables when watching it. But does this mean 
that these data streams make up the stuff that makes a movie? Clearly not, 
we should rather turn our attention to the stuff that is being taught at drama 
schools and directing schools. Similarly, von Neumann turned his attention to 
the actual physical concepts behind quantum theory, more specifically, the no- 
tion of a physical property and the structure imposed on these by the peculiar 
nature of quantum observation. His quantum logic gave the resulting 'algebra of 
physical properties' a privileged role. All of this leads us to ... 

1.2. ... the physics of it. Birkhoff and von Neumann crafted quantum logic 
in order to emphasize the notion of quantum superposition. In terms of states of 
a physical system and properties of that system, superposition means that the 
strongest property which is true for two distinct states is also true for states other 
than the two given ones. In order-theoretic terms this means, representing states 
by the atoms of a lattic^Jof properties [ST], that the join p V q of two atoms p 
and q is also above other atoms. From this it easily follows that the distributive 
lavij^] breaks down: given atonj^j r 7^ p,q with r < p V q we have r A (p V q) = r 
while (r A p) V (7' A q) — V = 0. Birkhoff and von Neumann as well as many 
others believed that understanding the deep structure of superposition is the 
key to obtaining a better understanding of quantum theory as a whole. But as 
already mentioned, 75 years later quantum logic did not break through. 

The Achilles' heel of quantum logic is the fact that it fails to elegantly capture 
'composition of quantum systems', that is, how do we describe multiple quantum 
systems given that we know how to describe the individual quantum systems. On 
the other hand, also in 1935, Schrodinger pushed forward the idea that the stuff 
which truly characterizes quantum behavior is precisely the manner in which 
quantum systems compose [72] . Over the past 30 years or so we have seen ample 
evidence for this claim. So-called 'quantum non-locality' was experimentally 
confirmed, and the focus on quantum information processing has revealed a wide 
range of quantum phenomena which all crucially depend on the manner in which 
quantum systems compose, most notably exponential quantum computational 
speed-up which led to the quantum computing paradigm |77j . 

Now reversing the roles, rather than explaining all of quantum theory in terms 
of superposition, can we maybe explain all of quantum theory in term of the 
manner in which quantum systems compose, including superposition? 

1.3. The game plan. Here is the list of tasks we've set ourselves: 

I.e. a partially ordered set with a minimal element and maximal element 1, and in which 
each pair of elements has a supremum and an infimum. In fact, there are physical resons for 
assuming that this lattice is complete 67 69], i.e. arbitrary suprema and infima exist. 

2 Distributivity means that for any elements a, b, c of the lattice we have that a A (b V c) = 
(a A b) V (a A c) and that a V (b A c) = (a V b) A (a V c). 

3 An atom is an element p 7^ which is such that whenever a < p then a = 0. 
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• Task 0. First we want to solve: 

tensor product structure 
the other Hilbert space stuff 

that is, we want to know what remains of the Hilbert space formalism if 
we 'remove all of its structure except for the manner in which systems 
compose'. In other words, we want to axiomatize composition of systems, 
which we denote by ®, without any reference to underlying spaces. 

• Task 1. Next we investigate which additional assumptions on ® are needed 
in order to deduce experimentally observed phenomena? That is, given that 
the structure deduced in Task applies to a wide range of theories (as we 
shall see below in Section [2| what extra structure do we need to add such 
that the resulting framework allows us to derive typical quantum behaviors. 

• Task 2. Once this 'typically quantum' structure has been identified, we 
take on the challenge to find this same structure elsewhere in what we 
usually conceive as 'our classical reality'. This may involve looking at this 
classical reality through a 'novel pair of glasses'. 

And, ... here are the resulting outcomes: 

• Outcome 0: That was an easy one. The solution to this has been around 
for quite a while. It is called symmetric monoidal category [11] , In fact, as 
discused in [24j [33] , physical processes themselves form a strict symmetric 
monoidal category, while set theory based models such as the Hilbert space 
model are typically non-strict, which invokes so-called 'coherence condi- 
tions' between 'natural transformations' [46J . But one can show that 
an arbitrary symmetric monoidal category is always 'categorically equiv- 
alent' to a strict symmetric monoidal category, which means that, up to 
isomorphisms, whatever one can do with a non-strict one, one can do with a 
strict one too. Hence, here we will only spell out strict symmetric monoidal 
categories, in terms of their graphical language [57, 68 , that is, a language 
which is such that an equational statement holds in it if and only if it follows 
from the axioms of a strict symmetric monoidal category. 

• Outcome la: Quoting Princeton philosopher Hans Halvorson in his edito- 
rial to the volume Deep Beauty: Understanding the Quantum World through 
Mathematical Innovation which marked 75 years since the publications of 
von Neumann's quantum formalism |52j : "What is perhaps most striking 
about Coecke's approach is the sheer ratio of results to assumptions." As 
we shall see below, with very little additional structure one can already de- 
rive a wide range of quantum phenomena, and the required computations 
are utterly trivial. This is in sharp contrast with Birkhoff-von Neumann 
quantum logic where one couldn't derive much; and in the case that one 
could derive something physically relevant one had to work really hard. 

• Outcome lb: Moreover, exposing this structure has already helped to 
solve standing open problems in quantum information, e.g. [141 1451 155] . 
and provided novel insights in the nature of quantum non- locality [271 128j . 

• Outcome lc: The diagrammatic framework underpinning strict symmet- 
ric monoidal categories has meanwhile been adopted by several leading 
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researchers in quantum foundations e.g. |171 1181 [55] ; quoting Lucien Hardy 
in [53] : "[...] we join the quantum picturalism revolution |24j" . 
• Outcome 2a: Observe the following similar looking pictures: 
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These are respectively taken from a physics paper on the flow of information 
in quantum protocols [TJ [211 123]; a linguistics paper on how to compute 
the meaning of a sentence given the meaning of its words [501 EH]> an d 
a probability theory paper that axiomatizes Bayesian inference [39] . The 
graphical calculi are in each case very similar, which points at a common 
reasoning systems in each of these very distinct areas. Note in particular 
that in each case the data of interest is of a fundamentally quantitative 
nature. Could this be pointing at the existence of some sort of quantitative 
logic, which is not typical to these areas but of a more universal nature? 

So let us now consider ... 

1.4. ... the logic of it. What is logic? The previous century has known 
a huge proliferation of logics of various kinds, and there probably are as many 
opinions of what logic actually is. Rather than making a case for one or another 
logical paradigm we will take a pragmatic stance and conceive logic in terms of 
its origin and its most prominent practical use: 

• Origin: structure in natural language. The origin of logic, tracing 
back to Aristotle, is that it is about 'arguments in natural language'. Con- 
sider for example the sentence: "Alice and Bob either ate everything or 
nothing, then got sick." By using connectives, quantifiers, variable / refer- 
ring to food, constants a(lice) and &(ob), and predicates Sicfc(person) and 
Eat(j>erson, some kind of food) we can formalize this as follows: 

(V/ : Eat(aJ) A Eat(b,f)) V -(3/ : Eat{aJ) A Eat(b,f)) => Sick{a), Sick(b) 

However, statements like this are still tightly related to a truth-concept, 
that is, we classify statements in terms of these either being true or not. 



THE LOGIC OF QUANTUM MECHANICS - TAKE II 



■5 




... you could be next! 



NEW! 

You can now purchase TheorvMine T-shirts, mouse-pads and mugs! 




_i: i:;ir 



1 choose 
name 



a 



theorem 



2 discover 



a 



great gift 



|| 3 | THEY get a 




Figure 1. The theory [mine] website which allows one to buy an 
automatically generated theorem and name it after someone. It 
is a novelty gift spin-off from the automated theory exploration 
expertise at Edinburgh University — see [TS] for the science. 

Clearly there is a lot more to the meaning of a sentence than it either being 
true or false. This leads us to the following questions: What do we mean 
by meaning? What is the logic governing meaning, more specifically, how 
do meanings of words interact to form meanings of sentences? 

• Use: automated reasoning. Logic now forms the foundation for fields 
like automated proof checking and automated theorem proving in computer 
science, which are key to modern methods for verifying the correctness of 
new software and hardware. Logic also controls robot behaviors in artificial 
intelligence. Even more adventurous is automated theory exploration, where 
one does not only try to automatically proof theorems, but also generate 
them, which is a much harder task (cf. P vs. NP) — see also Figure [TJ 

Our diagrammatic framework appeals to both of these senses of logic, and in 
doing so produced important new applications in each of these areas: 

• The above depicted framework for modeling how meanings of words inter- 
act to form meanings of sentences, introduced by Clark, Sadrzadeh and the 
author in [20, 38] . is the first to do so based on a clear conceptual under- 
pinning. It was a cover heading feature in New Scientist [5. and meanwhile 
greatly improved performance of several natural language processing tasks 
|51j . We explain this framework in Section |4j as well as its structural rela- 
tionship the graphical quantum formalism. 

• The diagrammatic formalism underpins the automated reasoning software 
quantomatic developed at Oxford and Google by Dixon, Kissinger, Merry, 
Duncan, Soloviev and Frot — see also Figure [2j More recently, work on 
automated theory exploration of graphical theories also started at Oxford 
[55] . building further on the work done at Edinburgh [55]. We won't discuss 
this here; details are in |41H43j and on the quantomatic website. 

§2. Minimal process logic. By a process logic we simply mean any strict 
symmetric monoidal category, and by minimal we mean that at this stage we 
consider no structure (yet) other than the strict symmetric monoidal structure. 
We explain this structure in terms of its graphical language. 
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Figure 2. Screenshot of the quantomatic software developed 
in a collaboration between Oxford and Google, which can be 
downloaded from |http: / / sites.googlex om /site/quantomatic / 

We could as well have given a symbolic presentation. We refer the reader 
to [2U [33J for such a symbolic presentation, exemplified for the specific case of 
cooking processes, and how they compose to make up recipes — [24] also discusses 
how a process logic explains why tigers have stripes while lions don't. 

2.1. Graphical language. The data of a minimal process logic consists of 
processes, represented by boxes, each of which takes some type of systems as its 
input, represented by (an) input wire(s), and some type of system as its output: 

-output wire(s) 



Box := / 

input wire(s) 

These types may be compound, or trivial, i.e. representing 'no system': 
one system n sub -systems no system 



1 n 

Examples of types could be a particular quantum system, classical data of a 
certain size, grammatical types, e.g. the type of a noun, verb, or a sentence, 
etc. A process with no input wire is called a state — one can think of these as 
'preparation processes'. Those with neither an input type nor an output type 
are called values. A process without an output type is called a valuation. 

The connectives of a minimal process logic constitute composition of processes. 
There are two modes: sequential or causal or connected composition, and, parallel 
or acausal or disconnected composition, respectively depicted as: 
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Figure 3. Examples of quantum mechanical concepts that can 
be expressed in purely topological terms, with the help of some 
new graphical elements. They are taken from [231 26, 29, 351 137] . 



So by post-composing a state with a valuation one obtains a value. Note that 
sequential composition requires the output type of / to be equal to the input 
type of g while no such restriction exists for parallel composition. 

The formal paradigm underpinning minimal process logic is a topological one: 




The topology captures 'what interacts with what', a wire standing for interaction 
while no wire stands for no interaction. It is surprising how much concepts 
can be expressed purely in these topological terms — e.g. see Figure [3] for some 
topologically characterized quantum mechanical concepts. 

The computational content of minimal process logic boils down to the simple 
intuitive rule that topologically equivalent diagrams are equal. Hence computa- 
tion proceeds by topological deformations: 




There is no additional equational content to a minimal process logic. This 
may sound surprising, since a strict symmetric monoidal category is subject to 
a number of axioms. The explanation is that in the graphical language all these 
equations become tautologies. For example, denoting sequential composition by 
o and parallel composition by (g), the 'bifunctoriality equation' (g o /) ® (k o h) = 
(g ® k) o (/ ® h) of monoidal categories becomes: 
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In terms of processes this means that l g after /, while, k after h' is the same as 
'<? while k, after, / while h\ 



§3. Quantum process logic - Take Ila. Our next goal is to derive some 
non-trivial quantum phenomena by endowing a minimal process logic with a tiny 
bit of extra structure, identified by Abramsky and the author in [HE]- 

3.1. Dagger compact structure. The first bit of extra structure will induce 
some kind of metric on the states, namely, we will ask that each state can be 
turned into a valuation; applying this valuation to any other state will yield a 
value. Note that this is exactly how the highly successful Dirac notation [40 
works: a ket \ip) can be turned into a bra and when composing with 
another ket \<f>) we obtain a bra-ket (ip\<j>) i.e. an inner-product. Since states may 
themselve arise by composing processes other than states, we will allow for the 
inputs and the outputs of any process to be 'flipped': 

B A 

v[/\ll[/7 
A B 

Note that flipping twice yields the original box, so flipping is involutive, and it 
is also clear that it preserves parallel composition, while it reverses sequential 
composition. We refer to flipping as the adjoint or dagger)^] 

So far we haven't said anything specific about the parallel composition. Now 
we will truly follow Schrodinger's path and specify in which manner quantum 
systems compose differently than classical systems. In other words, we will assert 
that pure quantum states admit entanglement, diagrammatically: 



quantum 
classical 




That is, a quantum state of two systems can in general not be described by de- 
scribing the state of its parts. Note that this is also not the case for probabilistic 
classical data: a situation of two systems which comes with the promise that the 
states of the system are the same but unknown, can also not be described by 
independently describing the state of each system. However, in quantum theory 



4 From the perspective of Birkhoff-von Neumann quantum logic, one could conceive this as 
the analog to an orthocomplementation on the lattice structure. That is, an order-reversing 
involution. Note in particularly that for non-Boolean lattices an orthocomplementation is a 
structure, not a property, as there can exist many different ones on the same lattice. In lattice 
theoretic terms the linear algebraic adjoint indeed arises as an expression involving Galois 
adjoints (— )*and orthocomplementation (— )', namely /' (a) = 1301 1471 . 
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this already occurs for states on which there is no uncertainty, that is, for which 
there exists a measurement that yields a particular outcome with certainty. 

So how do we provide a constructive witness for the fact that the state of 
two systems does not 'disconnect' in two separate one-system states? Simply by 
explicitly introducing a special two system state which is obtained by connecting 
its two outputs with a cup-shaped wire: 

Sticking to our topological paradigm, such a cup-shaped state for example obeys: 

PJ = | 

The equivalent symbolic expression for this equation would be: 

(f^ ® 1) o (1 <g> \^_J) = 1 

where 1 stands for a single straight wire and ( ^ is obtained simply by flipping 

^ ) i.e. its adjoint. We obtain a strict dagger- compact category [TJ[2]. 

3.2. Deriving physical phenomena. We assumed the existence of an ad- 
joint for any box and represent it via flipping. Cup- and cap-shaped wires also 
enable us to 'define' the transpose which we depict by rotating a box 180°: 




It then immediately follows that we have: 




that is, we can slide boxes across cup- and cap-shaped wires. Going berserk, 




that is, we can treat the entire graphical calculus for dagger compact categories 
in terms of beads which slide on wires. Now for some physics. We have: 
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and we choose / such that its composite with its adjoint yields the identity, 
something to which we refer as unitarity. Hence: 




Introducing agents Alice and Bob yields quantum teleportation: 




Note that, given that the quantum mechanical formalism was born in 1932, 
that this phenomenon took 60 years to be discovered [T^] . The standard quantum 
mechanical formalism provides no indication whatsoever that something like this 
would be possible, so one had to rely on sheer luck to discover it. 

A more detailed discussion of this graphical derivation and its physical inter- 
pretation is in [5TJ [21] . Similarly we derive another quantum mechanical feature, 
the entanglement swapping protocol [79) : 




So how much quantum mechanics can we derive in this calculus? 

3.3. Logical completeness wrt Hilbert spaces. The diagrammatic lan- 
guage presented above is directly related to the symbolic notion of a dagger 
compact category as follows: 

Theorem 1 (Kelly-Laplaza; Selinger [58, 75 ). An equational statement between 
expressions in the dagger compact categorical language holds if and only if it is 
derivable in the above described graphical calculus. 

Evidently there are many dagger compact categories, to mention two: 
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• Wires represent finite dimensional Hilbert spaces, boxes linear maps, the 
dagger is the linear algebraic adjoint, sequential composition is ordinary 
function composition, and the parallel composition of wires is the tensor 
product while parallel composition of boxes is the Kronecker product. 

• Wires represent sets, boxes relations, the dagger is the relational converse, 
sequential composition is composition of relations, and parallel composition 
is the cartesian product. 

The description of the compact structure for each of these as well as some more 
examples can be found in |33| . Evidently these two examples have very different 
spaces and one would evidently not associate sets and relations with quantum 
processes. Hence one could could wonder how much one can actually derive in 
(the graphical calculus for) dagger compact categories. The answer is surprising. 

Theorem 2 (Hasegawa-Hofmann-Plotkin; Selinger [531 [75]). An equational state- 
ment between expressions in dagger compact categorical language holds if and 
only if it is derivable in the dagger compact category of finite dimensional Hilbert 
spaces, linear maps, tensor product and linear algebraic adjoints. 

To put this in more quantum physics related terms, any equation involving: 

• states, operations, effects, ... 

• Bell-state, Bell-effect, transposition, conjugation, ... 

• inner-product, linear-algebraic trace, Hilbert-Schmidt norm, ... 

• adjoints (e.g. self-adjointness and unitarity), projections, positivity, ... 

• complete positivity (cf. [75]), ... 

holds in quantum theory if and only if it can be derived in the graphical language. 

§4. Natural language process logic. Before continuing with the further 
development of quantum process logic, we turn our attention on something com- 
pletely different: meaning in natural language, in particular, the from-word- 
meaning-to-sentence-meaning process. Meaning here manifestly goes beyond 
simply assigning truth values to sentences. 

4.1. From word meaning to sentence meaning. Consider as given the 
meanings of words. This can mean many things, for example, one has a dictio- 
nary available. On the other hand, there are no dictionaries for entire sentences. 
So how de we know what a sentence means? There must be some kind of mecha- 
nism, used by all of us, for transforming the meaning of words into the meaning 
of a sentence, since surely, we all understand sentences that we may have never 
heard before in our lives, provided we understand all of its words. 

There is a technological side to this. Search engines such as google and other 
natural language processing tools also have an understanding of meanings of 
words which they use to provide us with the most relevant outputs for our 
queries. The model of word meaning which these engines employ enables them 
to produce outputs that include words that are closely related to the words in 
our query, i.e. there doesn't have to be an exact match. 

However, searching on Google for "I want something that allows me to go faster 
than when I only use my legs" returns among its top hits: "Difference Between 
Oxycontin and Oxycodone" , "What are good ways for a girl to [XXX]" , "How to 
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Sprint Faster: 6 steps - wikiHow', "My Story - Onelegtim.com - Retired Police 
Officer & [...]" and "Golf Swing Power: What Your Legs Should Be Doing [...]". 
Neither of these point me in the direction of appropriate vehicles that would serve 
my purpose, so clearly there is no understanding of the meaning of my query. 
The reason is the lack of a theory that produces the meaning of a sentence from 
the meanings of its words, whatever the manner is in which we describe the 
meaning of words. 

Now, representing grammatical types of words by wires and their meanings by 
state-boxes we can depict a string of words as: 



But the overall type, i.e. the overall wire structure, depends on the grammatical 
structure of the sentence. However, sentences with different grammatical struc- 
ture may have the same meaning, and more general, we would like to have a hxed 
type for the meaning of all sentences. Hence there is some process, the from- 
word-meaning-to-sentence-meaning process, which transforms the meanings of 
the string of words in the meaning of the sentence made up from these: 



What drives this process? That is, given a string of words, what mediates their 
interaction? The answer is obvious: 

I 



since grammatically incorrect sentences have no clear meaning anyway. 

We can now describe the problem for from- word-meaning-to-sentence-meaning 
processes in more precise terms: 

• Given a theory of word meaning, and given a theory of grammar, how 
can we combine these into an algorithm which produces the meaning of 
sentences from the meanings of its words? 

As already mentioned, this problem was addressed by Clark, Sadrzadeh and the 
author in [5D1 |3H]. Let's stay at an abstract level a bit longer, before we will 
describe concrete theories of word meaning and grammar. What is a verb? A 
transitive verb is something that requires an object and a subject in order to 
yield a grammatically correct sentence. So we can think of a transitive verb as 
a process with three wires, two respectively requiring an object and a subject, 
and one producing the sentence: 
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sentence 
♦ 




object ^ — subject 

Since we rather represent a verb as a state we can use transposition, as denned 
above, to turn inputs into outputs and represent the verb as: 

object ^ sent l nce ^subject 




You may ask where these cups suddenly come from, but here we already antici- 
pate the description of grammatical structure that we discuss below. 

Note in particular also that for these kinds of word-states we again have: 



since otherwise, for the case of a transitive verb, the meaning of the sentence 
would not depend on the meanings of the nouns, which could have dramatic 
consequences. For example, considering the verb 'hate', it would be sufficient for 
one person to hate another person in order for everyone to hate everyone. 

4.1.1. A theory for word meaning. The current dominant theory of word 
meaning for natural language processing tasks is the so-called distributional or 
vector space model of meaning [73 . It takes inspiration from Wittgensteins phi- 
losophy of meaning is use [78] , whereby meanings of words can be determined 
from their context, and works as follows. One fixes a collection of n words, the 
context words, and considers an n dimensional vector space with chosen basis 
where each basis vector represents one of the context words. Then one selects 
a huge body of written text, the corpus. E.g. the internet, all editions of a cer- 
tain newspaper, all novels, the British National Corpu^] which is a 100 million 
word collection of samples of written and spoken language from a wide range 
of sources, etc. Next one decides on a scope, that is, a small integer fc, and for 
each context word x one counts how many times N x (a) a word a to which one 
wants assign a meaning occurs at a distance of at most k words from x. One 
obtains a vector (N\(a),... ,N n (a)), which one normalizes in order to obtain 
(0i (a), . . . , 0„(a)), the meaning vector of a. Now, in order to compare meanings 
of words, in particular, how closely their meanings are related, one can simply 
compute the inner-product of their meaning vectors. 

4.1.2. A theory of grammar. Algebraic gadgets that govern grammatical types 
have been around for quite a bit longer [4j [10l [19j [62] . There are several vari- 
ants available, each with their pro's and con's; here we will focus on Lambek's 
pregroups [63 . Philosophically, these algebraic gadgets trace back to Freges prin- 
ciple that the meaning of a sentence is a function of the meaning of its parts [33] . 
However, this is only manifest in that these algebras all have a composition op- 
eration that allows to build larger strings of words from smaller strings of words. 



5 This can be accesses at http://www.natcorp.ox.ac.uk/ 
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These algebras also have a relation < where a ■ . . . ■ z < t means that the string 
of types a ... z has as its overall type t. For example, n ■ tv ■ n < s expresses the 
fact that a noun, a transitive verb and a noun make up a sentence s. Finally, 
there are additional operations subject to certain laws which make up the actual 
structure of the algebra, and these would allow one to derive correct statements 
such as n ■ tv ■ n < s. 

For the specific case of pregroups, these additional operations are a left inverse 
) and a right inverse (— ) , subject to x-~ 1 (x) < 1 and (a;) -1 -x < 1 where 
1 is the unit for the composition operation, as well as to 1 < (x) • x and 
1 < x ■ (x)^ 1 . Now we have to assign grammatical types to the elements of a 
pregroup. Some will be atomic, i.e. indecomposable, while others like transitive 
verbs will be assigned compound types. Concretely, tv — (n) ■ s ■ (n) , hence 

n ■ tv ■ n — n ■ • s ■ (rt) -1 ) • n = (n ■ ~ (n)) • s ■ ■ nj < 1 • s ■ 1 = s , 

so the string of types 'noun transitive verb noun' indeed makes up a grammat- 
ically correct sentence. We can depict this computation graphically as follows. 
We start with five systems of respective types n, ~ 1 (n), s, (n)^ 1 and n: 

1 ,A I 1,1 

n "'(n) s (n) _1 n 
Then, we use caps to indicate that n and _1 (n), and, (n) -1 and n, cancel out: 



so that at the end the only remaining system is the sentence type. The caps here 
represent the equations x ■ _1 (x) < 1 and ■ x < 1. In fact, this is not just an 

analogy with the graphical language of compact categories. Pregroups are in fact 
compact categories! To see this, any partial order is a category, the composition 
provides the tensor, and while equations x ■ ~ 1 (x) < 1 and (x) -1 ■ x < 1 provide 
caps, equations 1 < _1 (a;) • x and 1 < x ■ (a;)" 1 provide cups. More details on 
this are in [3S] . The reason that there are two kinds of caps and cups is the fact 
that we are not allowed to change the order of words in a sentence while two 
physical systems do not come with some ordering. In category-theoretic terms, 
here we are dealing with a non-symmetric tensor. 

4.2. Combining theories. The structural similarity between the pregroup 
theory of grammar, and the vector spaces for word meaning when organized as a 
dagger compact category, is exactly what we will exploit to explicitly construct 
the from-word-meaning-to-sentence-meaning process. We consider the graphical 
representation of the proof of grammatical correctness of a sentence, substitute 
the sentence types by meaning vectors of the particular words we are interested 
in, and substitute the caps by the vector space caps, so we obtain: 





-i 



n _1 (n) s 



(n) 



n 




where the dotted line indicates the linear map that when applied to the vector 
— r-?> — — > — f 

Alice <E) hates <E) Bob produces the vector that we take to be the meaning of a 
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sentence. By rewriting this using transposition, as in Section [4. 1| the verb now 
acts as a fuction on the object and the subject: 




The meanings of all sentences live in the same vector space so we can again 
simply use the inner-product to measure their similarity. Grefenstette and 
Sadrzadeh have recently exploited this theory for standard natural processing 
tasks and their method outperforms all existing ones [ST] . 

What about the cups? They can be used to model 'special words' like "does" 
and "not", which have a clear 'logical' meaning. Here is an example of this: 




[meaning vectors of words 



As above, the wire structure here is obtained from the types of these words 
according to the pregroup grammar. Using cups we can model the meaning of 
'does', that is, 'does nothing really', and 'not', that is, 'negates meaning', for 
which we use an input-output not-box that does just that: 




and then we can simply use homotopy to compute: 




which is exactly what we would expect the meaning of Alice doesn't liking Bob 
to be: the negation of Alice liking Bob. This example also shows how the wires 
are mediating the 'flow' of word meaning in sentences. They allow for the words 
Alice and like, while far apart in the sentence, to interact. 

Turning things upside-down, one can now ask the question: why are there 
algebraic gadgets that describe grammatical correctness, i.e. why do these even 
exist. Our theory of word meaning explains this: they witness the manner of 
how word meanings interact to form the meaning of a sentence. 

4.3. An aside: quantizing grammar. An interesting analogy arises, which 
was first observed by Louis Crane, and which is discussed in detail in |70j . An 
important area of contemporary mathematics is the study of Topological Quan- 
tum Field Theory (TQFT) [SHE]- While it takes its inspiration from quantum 
field theory, it has become an area of research in its own right, mainly within 
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topology. The object of study is a monoidal functor: 

F : nCob -> FVect K :: / \ k> V 




from the compact category of closed (n — l)-dimensional manifolds with dif- 
feomorphism classes of n-dimensional manifolds connecting the closed (n — 1)- 
dimensional manifolds as morphisms, to the compact category of vector spaces 
over some field kJ^] Now, rather than taking a category of topological structures 
as domain, we can take a pregroup as domain, i.e. a category of grammatical 
structures, and obtain a grammatical quantum field theory: 



F : Pregroup — > FVectj 



§5. Quantum process logic - Take lib. Dagger compact categories cap- 
ture a substantial number of quantum mechanical concepts, and the dagger com- 
pact category FdHilb related to the von Neumann model described in Section 
3.3| is complete with respect to them. But they are by no means universal with 
respect to quantum theory by which we can mean two different things: 

• that they do not capture all quantum mechanical concepts, and, 

• that the language is not rich enough to describe all processes in FdHilb. 

Examples of concepts that are not captured by dagger compact language are 
the classical data obtained in measurements, observables themselves, and rela- 
tionships between these e.g. complementarity. Examples of FdHilb-processes 
not expressible in dagger compact language are basic quantum computational 
gates such as the CNOT-gate, phase-gates etc. We will now present an extended 
graphical language which does capture all of these. This was established in a se- 
ries of papers by Pavlovic, Paquette, Duncan, and the author in [251 126"! IM1 13"5] . 
The calculus was also rich enough to address a number of concrete quantum 
computational and quantum foundational problems e.g. see [TH 1271 [251 B51 [55] . 

Rather than only allowing for wires we allow for 'dots' at which wires branch 
into multiple wires, or none. We refer to these dots as ... 



'spiders' — 



So what is the analogue the topological calculus with cups and caps, and in 
particular, eq.Q? Similarly to 'however one bends a wire, it still remains just a 
wire that acts as an identity', any web of spiders with the same overall number of 
inputs and outputs, independent of how the web is build up, is again the same. 




If the field has a non-trivial involution then this category has a dagger too (7^ transposition). 
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So for any k > 0: 



m+m 





n+n' — k 

Hence, the rule governing spider calculus is that if two spiders 'shake legs', they 
fuse together. Again in other words, it only matters what is connected to what, 
but not the manner in which this connection is realized. 
This in particular implies that for the specific spiders: 




and 




we obtain eq.Q: 



0+2-1 




2+0-1 

so reasoning with spiders strictly generalizes reasoning with wires. 

In FdHilb a family of spiders of the above kind on-the-nose captures an 
orthonormal basis, which is a non-trival result. Firstly, one can show that rea- 
soning with those spiders is equivalent to working with a so-called dagger special 
commutative Frobenius algebra [3H [SHI HI] • Next one shows that these dagger 
special commutative Frobenius algebras in FdHilb are the same thing as or- 
thonormal bases Since bases allow to represent observables and classical 
data, we almost reached our goal, except for the fact that quantum theory only 
becomes interesting if we consider several 'incompatible' bases. 

So now we consider two different families of spiders, represented by a different 
gray scale. What happens if a dark gray and a light gray spider which represent 
complementary observables 'shake leggs'? Well, their 'legs fall off': 
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This was shown by Duncan and the author in 126) . Such a pair of differently 
colored spider families that interact in this manner forms the basis of a rich 
calculus with many more extra features than the ones described here. We refer 
the interested reader to [211 EH El] for more details and concrete applications. 

§6. The remaining challenge. In this paper we pushed forward the idea 
that the diagrammatic languages describing quantum phenomena as well as 
meaning-related linguistic phenomena may constitute some new kind of quanti- 
tative logic. The same logic also governs Bayesian inference, Bayesian inversion 
boiling down to nothing but transposition for appropriately chose cups and caps: 



This was established by Spekkens and the author in 39J, to which we refer for 
details. So where does traditional logic fit into this picture? 

One perspective is to start with standard categorical logic [31 02 IMj- The 
compact structure can then be seen as a resource sensitive variant (as in Linear 
Logic |50U74| ) which is degenerate in the sense that conjunction and disjunction 
coincide [22 , 44] . We do not subscribe (anymore) to conceiving the diagrammatic 
logic as a 'degenerate hyper-deductive variant' of standard logic in categorical 
form since this does not recognize the quantitative nor the process content. 

Rather, we would like to conceive the quantitative diagrammatic logic as 'the 
default thing' from which traditional qualitative logic arises via some kind of 
structural collapse. There are several results that could be taken as a starting 
point in this direction, for example, the generalization in [34] of Carboni and 
Walters' axiomatization of the category of relations [16 . But since this still 
belongs to the world of speculation, we leave this to future writings. 
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